A Test to Determine the Multivariate Normality of a Data Set

نویسندگان

  • Stephen P. Smith
  • Anil K. Jain
چکیده

This correspondence describes a new test for multivariate normality useful in pattern recognition. The test is based on the Friedman-Rafsky multivariate extension of the Wald-Wolfowitz runs test. We perform Monte-Carlo experiments to determine if the test is reliable in high dimensions with moderate sample size. We compare the test to some other tests mentioned in the literature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Canonical-Based Goodness-of-fit Tests for Multivariate Skew-Normality

It is well-known that the skew-normal distribution can provide an alternative model to the normal distribution for analyzing asymmetric data. The aim of this paper is to propose two goodness-of-fit tests for assessing whether a sample comes from a multivariate skew-normal (MSN) distribution. We address the problem of multivariate skew-normality goodness-of-fit based on the empirical Laplace tra...

متن کامل

A New Bootstrap Based Algorithm for Hotelling’s T2 Multivariate Control Chart

Normality is a common assumption for many quality control charts. One should expect misleading results once this assumption is violated. In order to avoid this pitfall, we need to evaluate this assumption prior to the use of control charts which require normality assumption. However, in certain cases either this assumption is overlooked or it is hard to check. Robust control charts and bootstra...

متن کامل

A new test for multivariate normality by combining extreme and nonextreme BHEP tests

In this paper we propose a new multiple test procedure for assessing multivariate normality which combines BHEP (Baringhaus-Henze-Epps-Pulley) tests by considering extreme and non-extreme choices of the tuning parameter in the definition of the BHEP test statistic. Monte Carlo power comparisons indicate that the new test presents a reasonable power against a wide range of alternative distributi...

متن کامل

A New Framework for Distributed Multivariate Feature Selection

Feature selection is considered as an important issue in classification domain. Selecting a good feature through maximum relevance criterion to class label and minimum redundancy among features affect improving the classification accuracy. However, most current feature selection algorithms just work with the centralized methods. In this paper, we suggest a distributed version of the mRMR featu...

متن کامل

Choosing the Best Hierarchical Clustering Technique Based on Principal Components Analysis for Suspended Sediment Load Estimation

1- INTRODUCTION The assessment of watershed sediment load is necessary for controling soil erosion and reducing the potential of sediment production. Different estimates of sediment amounts along with the lack of long-term measurements limits the accessibility to reliable data series of erosion rate and sediment yield. Therefore, the observed data of suspended sediment load could be used to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Trans. Pattern Anal. Mach. Intell.

دوره 10  شماره 

صفحات  -

تاریخ انتشار 1988